Quantum theory as the most robust description of reproducible experiments 
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I. INTRODUCTION 

Quantum theory has proven to be extraordinary powerful 
to describe a vast amount of very different experiments in 
(sub)-atomic, molecular and condensed matter physics, quan- 
tum optics and so on. Remarkably, the reason for the success 
of quantum theory is not understood at all. 

The present work is an attempt to fill this void. It is not con- 
cerned with the various interpretations [1-4] of quantum the- 
ory but it explores the possibility of exploiting logical infer- 
ence to give a rational explanation for the success of quantum 
theory as a description of a vast class of physical phenomena. 

We introduce the basic ideas of our approach by starting 
with a few quotes of Niels Bohr: 

1 . There is no quantum world. There is only an abstract 
physical description. It is wrong to think that the task 
of physics is to find out how nature is. Physics concerns 
what we can say about nature lUt]. 

2. Physics is to be regarded not so much as the study of 
something a priori given, but rather as the development 
of methods of ordering and surveying human experi- 
ence. In this respect our task must be to account for 
such experience in a manner independent of individual 
subjective judgment and therefore objective in the sense 
that it can be unambiguously communicated in ordinary 
human language 

3. The physical content of quantum mechanics is ex- 
hausted by its power to formulate statistical laws gov- 
erning observations under conditions specified in plain 
language |i6|]. 
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The first two sentences of the first quote may be read as a sug- 
gestion to dispose of, in Mermin's words JtJ, the "bad habit" 
to take mathematical abstractions as the reality of the events 
(in the everyday sense of the word) that we experience through 
our senses. The last two sentences of the first and the second 
quote suggest that we should try to describe human experi- 
ences (confined to the realm of scientific inquiry) in a manner 
and language which is unambiguous and independent of the 
individual. Of course, the latter should not be construed to 
imply that the observed phenomena are independent of the 
choices made by the individual(s) in performing the scientific 
experiment fsi. 

The third quote suggests that quantum theory is a powerful 
language to describe a certain class of statistical experiments 
but remains vague about the properties of the class. Similar 
views were expressed by other fathers of quantum mechanics, 
e.g.. Max Born and Wolfgang Pauli tgd. They can be sum- 
marized as "Quantum theory describes our knowledge of the 
atomic phenomena rather than the atomic phenomena them- 
selves". Our aim is, in a sense, to replace the philosophical 
components of these statements by well-defined mathemati- 
cal concepts and to carefully study their relevance for physical 
phenomena. Specifically, by applying the general formalism 
of logical inference to a well-defined class of statistical exper- 
iments, the present paper shows that quantum theory is indeed 
the kind of language envisaged by Bohr 

Theories such as Newtonian mechanics. Maxwell's elec- 
trodynamics, and Einstein's (general) relativity are deductive 
in character Starting from a few axioms, abstracted from 
experimental observations and additional assumptions about 
the irrelevance of a large number of factors for the descrip- 
tion of the phenomena of interest, deductive reasoning is used 
to prove or disprove unambiguous statements, propositions, 
about the mathematical objects which appear in the theory. 

The method of deductive reasoning conforms to the 
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Boolean algebra of propositions. The deductive, reduction- 
ist methodology has the appealing feature that one can be sure 
that the propositions are either right or wrong, and disregard- 
ing the possibility that some of the premises on which the 
deduction is built may not apply, there is no doubt that the 
conclusions are correct. Clearly, these theories successfully 
describe a wide range of physical phenomena in a manner and 
language which is unambiguous and independent of the indi- 
vidual. 

At the same time, the construction of a physical theory, and 
a scientific theory in general, from "first principles" is, for 
sure, not something self-evident, and not even safe. Our ba- 
sic knowledge always starts from the middle, that is, from the 
world of macroscopic objects. According to Bohr, the quan- 
tum theoretical description crucially depends on the existence 
of macroscopic objects which can be used as measuring de- 
vices. For an extensive analysis of the quantum measurement 
process from a dynamical point of view see Ref. [fio!]. Most 
importantly, the description of the macroscopic level is robust, 
that is, essentially independent on the underlying "more fun- 
damental" picture ifllll . As will be seen later, formalizing the 
notion of "robustness" is key to derive the basic equation of 
quantum theory from the general framework of logical infer- 
ence. 

Key assumptions of the deductive approach are that the 
mathematical description is a complete description of the ex- 
periment under consideration and that there is no uncertainty 
about the conditions under which the experiment is carried 
out. If the theory does not fully account for all the relevant 
aspects of the phenomenon that we wish to describe, the gen- 
eral rules by which we deduce whether a proposition is true or 
false can no longer be used. However, in these circumstances, 
we can still resort to logical inference [I12I - I15I1 to find useful 
answers to unambiguous questions. Of course, in general it 
will no longer be possible to say whether a proposition is true 
or false, hence there will always remain a residue of doubt. 
However, as will be shown, the description obtained through 
logical inference may also be unambiguous and independent 
of the individual. 

In the present paper, we demonstrate that the basic equa- 
tions of quantum theory directly follow from logical inference 
applied to experiments in which there is 

(i) uncertainty about individual events, 

(ii) the stringent condition that certain properties of the col- 
lection of events are reproducible, meaning that they are 
robust with respect to small changes in the conditions 
under which the experiments are carried out. 

It is the latter that renders the theoretical description unam- 
biguous and independent of the individual. In addition, our 
work provides a rational foundation for Bohr's philosophical 
viewpoints embodied in quotes (1-3). 

The paper is structured as follows. Section HI] contains a 
brief introduction to the algebra of logical inference lll2l - fl6ll . 
a mathematical framework |[l2l - [l6ll which formalizes the pat- 
terns of plausible reasoning exposed by Poly a UtIi . This 
mathematically precise formalism expresses what most peo- 
ple would consider to be rational reasoning. The key concept 



is the notion of the plausibility that a proposition is true given 
that another proposition is true. Section |III]discusses the role 
of uncertainties in experiments and classifies their theoretical 
descriptions. In Section HVl we show in detail how logical in- 
ference can be used to derive the quantum theoretical descrip- 
tion of the Einstein-Podolsky-Rosen-Bohm thought experi- 
ment without invoking a single concept of quantum theory. 
Section|V]uses the Stern-Gerlach experiment to illustrate how 
the approach of Section|II]may be extended by adding features 
abstracted from the experiment. These two sections are based 
on earlier attempts to derive the expressions of quantum the- 
ory by logical inference lUslfToll . Finally, we demonstrate that 
the time-independent (Section IVIl l and time-dependent (Sec- 
tion IVIIt Schrodinger equation can be derived by logical in- 
ference from the assumption that the experiment yields repro- 
ducible data. A discussion of general aspects of our approach 
and conclusions are given in Section lVIIII 



II. THE ALGEBRA OF LOGICAL INFERENCE 

Obviously, any attempt to capture the process of human 
reasoning by which the events are registered by our senses 
and are brought in relation to each other, leading to abstract 
concepts, is bound to create more problems than we can ever 
solve. However, if we are only concerned about quantifying 
the truth of a proposition given the truth of another proposi- 
tion, it is possible to construct a mathematical framework, an 
extension of Boolean logic, that allows us to reason in a man- 
ner which is unambiguous and independent of the individual, 
in particular if there are elements of uncertainty in the descrip- 
tion liMl]. 

In this section, we briefly introduce the concepts that are 
necessary for the purpose of the present paper For a detailed 
discussion of the foundations of plausible reasoning, its rela- 
tion to Boolean logic and the derivation of the rules of logical 
inference, the reader is advised to consult the papers lll2[|l6ll 
and books lfl3l - fl5ll from which our summary has been ex- 
tracted. 

We start by listing three so-called "desiderata" from which 
the algebra of logical inference can be derived MM^- The 
formulation which follows is taken from Ref. III6I1 . 

Desideratum 1. Plausibilities are represented by real num- 
bers. The plausibility that a proposition A is 
true conditional on proposition B being true 
will be denoted by P{A\B). 

Desideratum 2. Plausibilities must exhibit agreement with 
rationality. As more and more evidence 
supporting the truth of a proposition be- 
comes available, the plausibility should in- 
crease monotonically and continuously and 
the plausibility of the negation of the propo- 
sition should decrease monotonically and 
continuously. Moreover, in the limiting 
cases that the proposition A is known to be 
either true or false, the plausibility P{A\B) 
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should conform to the rules of deductive rea- 
soning. In other words, plausibilities must 
be in qualitative agreement with the pat- 
terns of p lausible reasoning uncovered by 
Polyalil. 

Desideratum 3. All rules relating plausibilities must be con- 
sistent. Consistency of rational reasoning 
demands that if the rules of logical inference 
allow a plausibility to be obtained in more 
than one way, the result should not depend 
on the particular sequence of operations. 

These three desiderata only describe the essential features of 
the plausibilities and definitely do not constitute a set of ax- 
ioms which plausibilities have to satisfy. It is a most re- 
markable fact that these desiderata suffice to uniquely deter- 
mine the set of rules by which plausibilities may be manipu- 
lated iliMl. 

Omitting the derivation, it follows that plausibilities may be 
chosen to take numerical values in the range [0, 1] and obey 
the rules JiMl 

a. P{A\Z) + P{A\Z) = 1 where A denotes the negation of 
proposition A and Z is a proposition assumed to be true. 

b. P{AB\Z) = P{A\BZ)P{B\Z) = P{B\AZ)P{A\Z) where the 
"product" BZ denotes the logical product (conjunction) of 
the propositions B and Z, that is the proposition BZ is true if 
both B and Z are true. This rule will be referred to as "prod- 
uct rule". It should be mentioned here that it is not allowed 
to define a plausibility for a proposition conditional on the 
conjuction of mutual exclusive propositions. Reasoning on 
the basis of two of more contradictory premisses is out of 
the scope of the present paper. 

c. P{AA\Z) = and P{A + A|Z) = 1 where the "sum" A + 
B denotes the logical sum (inclusive disjunction) of the 
propositions A and B, that is the proposition A+B is true 
if either A or B or both are true. These two rules show that 
Boolean algebra is contained in the algebra of plausibili- 
ties. 

The rules (a-c) are unique. Any other rule which applies to 
plausibilities represented by real numbers and is in conflict 
with rules (a-c) will be at odds with rational reasoning and 
consistency lfl4l - [l6ll . 

The reader will no doubt recognize that rules (a-c) are 
identical to the rules by which we manipulate probabili- 
ties ifTsl l20l - l22ll . However, the rules (a-c) where not postu- 
lated. They were derived from general considerations about 
rational reasoning and consistency only. Moreover, concepts 
such as sample spaces, probability measures etc., which are 
an essential part of the mathematical foundation of probabil- 
ity theory [21, 22], play no role in the derivation of rules (a-c). 
In fact, if Kolmogorov's axiomatic formulation of probability 
theory would have been in conflict with rules (a-c), we be- 
lieve that this formulation would long have been disposed of 
because it would yield results which are in conflict with ra- 
tional reasoning. Perhaps most important in the context of 



quantum theory is that in the logical inference approach un- 
certainty about an event does not imply that this event can 
be represented by a random variable as defined in probability 
theory {23]. Definitely, uncertainty is not the same as "ran- 
domness". 

We emphasize that there is a significant conceptual dif- 
ference between "mathematical" probabilities and plausibil- 
ities. Mathematical probabilities are elements of an axiomatic 
framework which complies with the algebra of logical infer- 
ence. Plausibilities are elements of a language which also 
complies with the algebra of logical inference and serve to 
facilitate communication, in an unambiguous and consistent 
manner, about phenomena in which there is uncertainty. 

The plausibility P{A\B) is an intermediate mental construct 
that serves to carry out inductive logic, that is rational reason- 
ing, in a mathematically well-defined manner lfl4ll . In general, 
P{A\B) may express the degree of believe of an individual that 
proposition A is true, given that proposition B is true. How- 
ever, in the present paper, we explicitly exclude applications 
of this kind because they do not comply with our main goal, 
namely to describe phenomena "in a manner independent of 
individual subjective judgment", see Bohr's quote (2). 

To take away this subjective connotation of the word 
"plausibility", from now on we will simply call P{A\B) 
the "inference-probability" or "i-prob" for short. 

The algebra of logical inference is the foundation for 
powerful tools such as the maximum entropy method and 
Bayesian analysis lfl4l [Tsll . Although not formulated in 
the language of logical inference used in the present paper, 
Jaynes' papers on the relation between information and (quan- 
tum) statistical mechanics ll23ll24ll are perhaps the first to "de- 
rive" theoretical descriptions using this general methodology 
of scientific reasoning. As we show in this paper, quantum 
theory also derives from the application of the algebra of log- 
ical inference. In fact, the latter allows us to do exactly what 
Bohr suggested, namely "to account for such experience in a 
manner independent of individual subjective judgements and 
therefore objective in the sense that it can be unambiguously 
communicated in ordinary human language fl". Therefore, 
our derivation supports Bohr's philosophical viewpoint that 
"the physical content of quantum mechanics is exhausted by 
its power to formulate statistical laws governing observations 
under conditions specified in plain language" [|6l]. 

It is important to keep in mind that the rules of logical in- 
ference are not bound by "the laws of physics". In particular, 
logical inference also applies to situations where there are no 
causal relations between the events ITillTsll . From our point 
of view, the laws of physics provide a consistent description 
of relations between certain events that we perceive by our 
senses and therefore they should conform to the rules of logi- 
cal inference, not vice versa. Although extracting cause-and- 
effect relationships from empirical evidence by rational rea- 
soning should follow the rules of logical inference, in general 
the latter cannot be used to establish cause-and-effect relation- 
ships |15, 25, 26]. 

A comment on the notation used throughout this paper is 
in order To simplify the presentation, we make no distinction 
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between an event such as "detector D has fired", and the cor- 
responding proposition "D = detector D has fired". If we have 
two detectors, say Dx where x ~ ±1, we write P{x\Z) to de- 
note the i-prob of the proposition that detector D, fires, given 
that proposition Z is true. Similarly, the i-prob of the proposi- 
tion that two detectors Dx and D'y fire, given that proposition 
Z is true, is denoted by P{x,y\'Z,). Obviously, this notation 
generahzes to more than two propositions. 



III. QUANTUM THEORY AS AN INSTANCE OF LOGICAL 
INFERENCE 

The theoretical description of "classical physics" applies to 
phenomena for which there is absolute certainty about the out- 
come of each individual experiment on each individual ob- 
ject ll27l - l29ll . In mapping the experimental data which are 
necessarily represented by a limited number of bits, that is by 
integers, onto the theoretical abstractions in terms of real num- 
bers, it is assumed that the necessarily finite precision of the 
experiment can be increased without limit, at least in princi- 
ple, and that there is a one-to-one mapping between the values 
of the variables in the theory and the values of the correspond- 
ing quantities measured in experiment. 

In real experiments there is always uncertainty about some 
factors which may or may not influence the outcome of the 
measurements. In the realm of classical physics, standard 
techniques of statistical analysis are used to deal with this is- 
sue. It is postulated that these "imperfections" in the experi- 
mental data are not of fundamental importance but are techni- 
cal in nature and can therefore be eliminated, at least in prin- 
ciple iHthii. 

Quantum theory is fundamentally different from classical 
theories in that there may be uncertainties about each individ- 
ual event, uncertainties which cannot be eliminated, not even 
in principle ll27l - l29ll . Clearly, this is a statement about the the- 
ory, not about the observed phenomena themselves. The out- 
come of a real experiment, be it on "classical" or "quantum" 
objects, is always subject to uncertainties in the conditions un- 
der which the experiment is carried out. However, this issue 
is not of direct concern to us here because we only want to 
explore whether the quantum theoretical description, not the 
phenomena themselves, can be derived from logical inference 
applied to certain thought experiments. 

Summarizing, we may classify theoretical abstractions of 
scientific experiments as follows: 

Category 1. The conditions under which the experiment is 
carried out are known and fixed for the dura- 
tion of the experiment and there is no uncertainty 
about each event. 

Category 2. Each event under known conditions is certain 
but the conditions under which the experiment 
is carried out may be uncertain. 

Category 3. There may be uncertainty about each event and 
the conditions under which the experiment is 
carried may be uncertain. 



A laboratory experiment always falls in category 3. In a strict 
sense, numerical experiments on a digital computer belong to 
category 1 . However, disregarding the fact that in the course 
of the numerical experiment the time-evolution of each in- 
dividual bit of the computer is completely determined and 
known, in practice, the complexity of the numerical simula- 
tion is often so large that the variables of interest may exhibit 
behavior that is similar to the one observed in experiments 
belonging to category 2 and 3. 

In the theoretical description of a real experiment, it makes 
sense to simplify matters by first exploring models that belong 
to category 1 (classical physics) and if no satisfactory descrip- 
tion is obtained to consider models of category 2 (classical 
physics supplemented with probability theory). If the latter 
fails to describe the experiment too, we can still try models in 
category 3. 

The fact that laboratory experiments always belong to cate- 
gory 3 has an important implication. A basic requirement for 
any scientific experiment is that the analysis of the data yields 
quantities (e.g. frequencies, averages, correlations, etc.) that 
exhibit a high degree of reproducibility. Only then it may 
make sense to attempt drawing scientifically meaningful con- 
clusions from these data. Clearly, this requirement restricts 
the uncertainties on the conditions under which the experi- 
ment is carried out. If these uncertainties fluctuate wildly with 
each measurement, it is unreasonable to expect reproducible 
results. 

Therefore, it seems justified to limit attention to a subset of 
theoretical models of category 3 which satisfies the following 
criteria: 

Category 3a. There may be uncertainty about each event. 

The conditions under which the experiment is 
carried out may be uncertain. The frequen- 
cies with which events are observed are repro- 
ducible and robust against small changes in the 
conditions. 

As we show in this paper, the rules of logical inference applied 
to models belonging to category 3a rather straightforwardly 
lead to the basic equations of quantum theory. 

The derivation has a generic structure. The first step is to 
list the features of the experiment that are deemed to be rel- 
evant and to introduce the i-probs of the individual events. 
The second step is to impose the condition that the experi- 
ment yields reproducible results, not on the level of individual 
events, but on the level of averages of many events. The result 
of the second step is a functional of the i-prob, the minimum 
of which yields an expression for the i-prob which is identical 
to the corresponding probability obtained from the quantum 
theoretical description of the experiment. 

IV. EINSTEIN-PODOLSKY-ROSEN-BOHM THOUGHT 
EXPERIMENT 

As a first illustration, we consider Bohm's version of the 
Einstein-Podolsky-Rosen thought experiment ll30l Isill . To 
head off possible misunderstandings, the derivation presented 
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FIG. 1. (Color online) Diagram of the EPRB thought experiment. 
The source 5, activated at times labeled by ij = 1,2, . . . ,A', sends a 
signal to the router Ri and another signal to the router J?2- Depending 
on the orientations of the routers, represented by unit vectors ai and 
32, the signal going to the left (right) is detected with 100% certainty 
by either 1 or£)_ i (Z)+.2 or£)_ 2)- 



in this section does not add anything to the ongoing discus- 
sions about locaHty, reaUsm, etc. in relation to the violation of 
Bell-like inequalities fMUlSl 

We choose the Einstein-Podolsky-Rosen-Bohm (EPRB) 
thought experiment as the first example because it seems to 
be the simplest nontrivial model for demonstrating how the 
logical inference approach works. Indeed, a straightforward 
application of the ideas of Section|lII]yieIds an expression for 
the i-prob to observe detection events which is identical to the 
probability distribution obtained from the quantum theoretical 
description in terms of the singlet state of two spin- 1/2 parti- 
cles ml Ell ■ 



A. Experiment 

The layout of the EPRB thought experiment is shown in 
Fig.[T] In contrast to the conventional quantum theoretical de- 
scription IsilliTll . we keep the number of assumptions about 
the experiment itself to a minimum. Specifically, we assume 
that 

a. Each time the source S is activated, it sends a signal to the 
left and another one to the right. For the present purpose, it 
is not necessary to make any assumption about the nature 
of or the correlation between these two signals. 

b. The observation station i ~ 1,2 contains a "router" 
which sends the signal to either detector D+j or detector 
D j. The decision to send the signal to either D+ , or 
D i depends on the directions a,- of the router a,- be- 
ing a three-dimensional unit vector. The orientations of the 



routers are relative to the fixed laboratory frame of refer- 
ence. 

c. The detectors register the signal and operate with 100% ef- 
ficiency, that is, if « = 1,2, .. . labels the time at which 
the source is activated, the firing of the detectors produces 
a pair of integers {x„,y,j} where x„ — ±1 {y„ — ±1) repre- 
sents the firing of D± i (Z)±,2)- 

The result of a run of the experiment for fixed ai and a2 is 
a data set of pairs 

T = {x,„yn\xn = ±Uyn = ±Un = 1, . . . ,A^}, (1) 
where is the total number of signal pairs emitted by the 
source. From the data set Eq. ([T), we compute the averages 

W-^L-' ' (y) = ^iy^^ (2) 

i= 1 /= 1 

the correlation, 

1 ^ 

!=1 

and coincidences 

N 

«xy = ^5v,.v,-5y,y,-! (4) 

/=1 

which represents the number of events of the type The 
assumptions (a-c) and Eqs. (|2|i - (|4li represent our perception 
about the experiment and specify the data analysis procedure, 
respectively. 

B. Inference-probability of the data produced by the 
experiment 

The next step is to formalize the general features of the pos- 
sible outcomes of the experiment. 

1. The i-prob to observe a pair {x,y} is denoted by 
P{x,y\ai,a2,Z) where Z represents all the conditions un- 
der which the experiment is performed, with exception of 
the directions ai and a2 of the routers Ri and R2, respec- 
tively. It is assumed that the conditions represented by Z 
are fixed and identical for all experiments. 

It is not difficult to see that any real-valued function f{x,y) 
of two dichotomic variables x,y = ±1 can be written as 



(l-x)(l-y)/(-l,~l) + (l+x)(l~y)/(+l,-l) + (l-x)(l+y)/(-l, +!) + (! +,r)(l+y)/(+l,+l) 

4 

/(-L-l)+/(+L-l)+/(-l,+l)+/(+L+l) -/(-1,-1) +/(+!, -1) -/(-I, +l)+/(+l,+l) 

4 " 4 

, +/(-!, +1) +/(+!, +1) /(-!,-!) -/(+!, -1) -/(-I, +l)+/(+l,+l) 

-^y 4 4 

(5) 
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From this general identity, it immediately follows that f (x,y|ai,a2,Z) can be written as 

„, I £0(31,32,2) +;c£'i (ai, 39, Z)+_y£'2(ai, 32, Z)+xy£'i2(ai, 32, Z) 

P(jc,y|3i,32,Z) = , (6) 

where 



£■0(31, 32, Z) 


= f(^,3'|3l,32,Z) = l, 




x,y=±\ 


£1(31, 32, Z) 


= L ■*^^(-*^'3'|ai,32,Z), 




A-,>-±l 


£2(31, 32, Z) 


= E >'^'(-^,3'|3l,32,Z), 




.v,y=±l 


£12(31, 32, Z) 


= L ■^3'^'(-^,3'|3l,32,Z). 

JC,>>=±1 



(7) 

Furthermore, from Eq. (|2) and rule (a), it follows directly that |£i(3i,32,Z)| < 1, |£2(3i,32,Z)| < 1, and |£i2(3i,32,Z)| < 1. 

I 

2. For simplicity, it is assumed that there is no relation be- In probability theory, events with these properties are called 

tween the actual values of the pairs {x,,,}?,,} and {x,,/,}?,,/} Bernoulli trials, a concegt that is central to many results in 

if n 7^ n' . In other words, as far as we know, each repetition probability theory Iil4[ [Tsl |22|l- Invoking the product rule, 

of the experiment represents an identical event of which the the logical consequence of this assumption is that 
outcome is logically independent of any other such event. 

I 



P{x\,yi,. . . ,;CAr,yAr|3i,32,Z) P{x\ ,y\ |JC2,3'2, ■ ■ ■ ,JCAf,yAr,3l ,32,Z)f(x2,3'2, • ■ ■ jXiVj^A- |3i , 32,Z) 

= ■f'(jCl ,^1 |3l,32,Z)P(jC2,3'2, • • • ,-«A',3'A'|3l,32,Z) 
= -P(jCl,3'l|3l,32,Z)P(jC2,y2|jC3,3'3,---,JCA',3'A',3l,32,Z) 

X-P(-«3,>'3, • • • ,JCA',3'A'|3l,32,Z) 
= P{x\,y\ |3i,32,Z)P(x2,}'2|3l,32,Z)P(x3,y3, . . . ,X7v,y7v|3i , 32,Z) 

N 

= n^(^''3''|ai,a2,Z), (8) 

1=1 

meaning that the i-prob P{x\,y\,. . . ,XM,yN\^l^^2^'Z') to observe the compound event {{xi^yi},...,{xN,yN}} is uniquely 
determined by the i-prob P(x,3'|3i ,32,Z) to observe the pair {x,}/}. 



3. It is assumed that the i-prob f (jc,3'|3i,32,Z) to observe a 
pair {x,y} does not change if we apply the same rotation 
to both routers ^1 and R2, while leaving the source S un- 
touched. This implies that 

P(x,3;|3i,32,Z) =P(x,3;|3i •32,Z) =P(x,3;|0,Z), (9) 

where Q = arccos(3i • 32) denotes the angle between the 
unit vectors 3i and 32. For any integer value of K, 6 + 2kK 
represents the same physical arrangement of the routers R\ 
and R2. 

4. It is assumed that observing x = +1 (y = +1) is as likely as 
observing x = — 1 (y = — 1), independent of the directions 
3i and 32. From Eq. Q, it follows that 



£l(3i,32,Z) =£2(3i,32,Z) =0. 



(10) 



Assumptions (3) and (4) formalize our expectations about the 
symmetries of the experimental setup. 

Using Eqs. (|6]l, (|9|l, and (fTOl i we find that the i-prob to ob- 
serve a pair {x,y} simplifies to 



P(x,y|0,Z) 



\+xyEn{e) 



(11) 



where £12(0) = £12(31, 32, Z) is a periodic function of Q. 



C. Condition for reproducibility 

Although the data set Eq. ([T]) changes from run to run, we 
expect that the averages Eq. (|2]i, the correlation Eq. (|3} and the 
coincidences Eq. (|4| exhibit some kind of robustness, smooth- 
ness with respect to small changes of 0. If this were not the 
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case, these numbers would vary erratically with 9, most likely 
the results would be called "irreproducible", and the exper- 
imental data would be disposed of because repeating the run 
with a slightly different value of 9 would often produce results 
that are very different from those of other runs. 

Obviously, the important feature of robustness with respect 
to small variations of the conditions under which the experi- 
ment is carried out should be reflected in the expression for 
the i-prob to observe data sets which yield reproducible aver- 
ages and correlations (with the usual statistical fluctuations). 
Having exploited all elementary knowledge about the exper- 
iment (see Sections IIV Al and IIV Bl ). the next step therefore 
is to determine the expression for P{x,y\9,Z) which is most 
insensitive to small changes in 9. 

Let us assume that for a fixed value of 9, an experimen- 
tal run of events yields «„• events of the type {x,y} where 

«++ + « ^ + «+_ +11 = A'. The number of different data 

sets yielding the same values of «++, « ^, «+-, and n is 

A^!/(n++)!(n_+)!(n+_)!(«__)!. According to Eq. ©, the i- 
prob that events of the type {x,y} occur n„ times is given by 
Ylx.Y=±i P{x,y\(^ 7^)"" ■ Therefore, the i-prob to observe the 
(compound) event {yi++,n !_,«+_,« } is given by 



P(«++,n_+,«+_,«__|0,A?,Z)=A?! n 

x,y=±l 



P{x,y\9,ZY'-'y 



If the outcome of the experiment is indeed described by 
the i-prob Eq. ( fTSl i and the experiment is supposed to yield 
reproducible results, small changes of 9 should not have a 
drastic effect on the outcome. So let us ask ourselves how 
the i-prob would change if the experiment is carried out with 
9 + E (£ small) instead of with 9. 

It is expedient to formulate this question as an hypoth- 
esis test. Let Hq and H\ be the hypothesis that the data 

{«++,«_+, «^ ,« } is observed if the angle between the 

unit vectors ai and a2 is 9 and + £, respectively. The ev- 
idence Ev of hypothesis Hi, relative to hypothesis Hq, is de- 
fined by Ollil 

Ev = ln^(";+'"-+'"+-'"-" + ^'^'^\ (13) 

where the logarithm serves to facilitate the algebraic manip- 
ulations. If H\ is more (less) plausible than Hq then Ev > 
(Ev < 0). 

Making use of Eq. (fT2b . we find 



^ P{x,y\9 + £,Z) 

Ev = ) «rv In ; r 



(14) 



(12) Writing Eq. ( fT4] l as a Taylor series in e we have 



Ev: 



.v,v=±l 



p'jxMe.z) £2 / p^(x,y|e,z) y p" {x,y\9 ,z) 

'P{x,y\9,Z) 2 \P{x,y\9,Z)) 2 P{x,y\9,Z) 



(15) 



where a prime denotes the derivative with respect to the vari- 
able 9. 

According to our criterion of robustness, the evidence 
Eq. (flSl l should change as little as possible as e varies. The 
contribution of the term in e can be made to vanish by substi- 
tuting Uxy = aP{x,y\9 ,Z). From 

N= «-vv = « L P{xM9,Z) = a, (16) 

x.y=±\ x.y=±\ 

it follows that a =N. Then we have 

x,y=±l 

-N^l^O. ill) 

Using the same reasoning, it follows that the third term in 
Eq. ( fTSb also vanishes. 

Although our choice P{x,y\9,Z) ~ n„/N is motivated by 
the desire to eliminate contributions of order e, it follows that 
our criterion of robustness leads us to the intuitively obvious 



procedure which assigns to P{x,y\9,Z) the value of the ob- 
served frequencies of occurrences iixy/N. As shown in the 
appendix, for large A^, the same procedure also follows from 
searching for the f (x,y|0,Z)'s which maximize the i-prob to 
observe {«++,«_+,«+_,« }. However, our criterion of ro- 
bustness yields more than just this assignment, namely it also 
determines the 0-dependence of the i-probs. 

Indeed, the second term in e cannot be made to vanish, ex- 
cept by assuming that P{x,y\9,Z) does not depend on 9, in 
which case the description can only apply to experiments for 

which } does not exhibit any dependence 

on 9. Excluding this fairly uninteresting set of experiments, 
the final expression for the evidence reads 

Fv- y 1 f dPixMe,z) Y 3 

(18) 

In order to minimize the variation of Eq. (ITFt as a function of 
e as much as possible, we should discard the prefactor —N/2 
and minimize 

wm- V 1 ( dp{xMe,zW 

which is the Fisher information for the problem at hand. 



Using Eq. ( fTTT i. we can rewrite Eq. iT9\ as 



If{0) = 



1 fdEn{9] 



(20) 



Without loss of generality, we may simplify Eq. ( l20l i by sub- 
stituting £12(0) = cosg{9), yielding 



(21) 



As mentioned earlier, the goal is to find the expression 
of the i-prob P{x,y\9,Z) or, equivalently, the function g{0) 
which minimizes the Fisher information Eq. (|20] |. This is tan- 
tamount to requiring that the variation of /f (0) 



5/f(0) 



fdg{e)\ fd5g{e) 



\ de J \ de 



(22) 



with respect to the variation 5g{e) vanishes, yielding g{e) = 
ae + where a and (p are constants. 

As £12(0) is a periodic function of we must have a^K 
where K is an integer and hence 



£12(0) = cos {Ke + ^). 



(23) 



From Eq. (|20ll it follows that /f(0) = K^, independent of 0. 
As mentioned earlier, we discard the solution /f(0) =0 be- 
cause it describes an uninteresting experiment for which the 
data is expected to be independent of 0. Therefore, the phys- 
ically relevant, nontrivial solution with minimum Fisher in- 
formation corresponds to K =\. Furthermore, as £12(0) is a 
function of ai • 82 = cos only, we must have = 0, tt, re- 
flecting an ambiguity in the definition of the direction of Ri 
relative to the direction of /?2- 

Choosing the solution with (j) = 71, the two-particle correla- 
tion reads 



£i2(ai,a2,Z) = -COS0 = -ai -32, 



(24) 



in agreement with the expression for the correlation of two 
5=1/2 particles in the singlet state ll3l[|47ll . 




D. Discussion 



FIG. 2. (Color online) Diagram of the Stern-Gerlach experiment. 
The source 5, activated at times labeled by n = 1,2, . . . ,A', sends 
a particle carrying a magnetic moment S to the magnet M with its 
magnetization in the direction a. Depending on the relative directions 
of a and S, the particle is detected with 100% certainty by either 
orD-. 



Ri and R2 transform the received signal into two- valued sig- 
nals, and that (iii) the i-prob describing the frequencies of 
the observed events depend on the relative orientation of the 
routers only, see Eq. (|9)- Apparently, the latter requirement 
suffices to recover the salient feature of the singlet state of 
two spin- 1/2 particles, namely that the state vector = 
(I t|> — I it>) /V2 is invariant for rotations, implying that 
its physical properties do not depend on the direction chosen 
to define "up" or "down" llsil l47ll . Realizing conditions (i) 
and (iii) in a real EPRB experiment is not a trivial matter IHSl - 



V. STERN-GERLACH EXPERIMENT 

The expression Eq. (l24l i for the correlation of the data pro- 
duced by an EPRB experiment has been obtained without 
making specific assumptions about the nature of the signals 
nor about the operation of the routers Ri and R2- In this sec- 
tion, we add some extra assumptions and we show how the 
same reasoning of Section|lV]leads to the expression of a sim- 
ple quantum mechanical model of the Stern-Gerlach magnet, 
see Fig. |2] In order to avoid repetition, in the following we 
leave out arguments/derivations/discussions which, with mi- 
nor changes have been given earlier 



We have shown that the application of our criterion of ro- 
bust, reproducible experiments to the EPRB thought experi- 
ment depicted in Fig. [T] amounts to minimizing the Fisher in- 
formation Eq. (I20I 1 for this specific problem. The result of this 
calculation is the correlation Eq. (iMt which is characteristic 
of quantum theory for the singlet state. Needless to say, our 
derivation did not use any concepts of quantum theory. Only 
plain, rational reasoning strictly complying with the rules of 
logical inference and some elementary facts about the experi- 
ment were used. 

It is most remarkable that the equations of quantum the- 
ory for a system in the singlet state appear by simply re- 
quiring that (i) everything which is known about the source 
is uncertain, except that it emits two signals, (ii) the routers 



A. Experiment 

We start by listing the assumptions about the nature of the 
signal and the action of the magnet on the signal. Specifically, 
we assume that 

a. The signal emitted by the source takes the form of a parti- 
cle which carries a magnetic moment represented by a unit 
vector S. The magnetic moment interacts with the mag- 
netic field generated by the magnet M. This field is a func- 
tion of the direction a of the magnet only. The direction of 
the magnetic moment and magnet are relative to the fixed 
laboratory frame of reference. 
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b. As the particle passes through the magnetic field, it is di- 
rected towards either or D . The mechanism which 
causes this to happen is assumed to depend on a • S = cos 9 
only. In other words, the distribution of the number of par- 
ticles detected by or D does not change (within the 
usual statistical fluctuations) if both the magnetic moment 
of the particles and the direction of the magnetic field are 
rotated by the same amount. 

c. The detectors count the particles with 100% efficiency, that 
is, if n = 1,2, .. . labels the time at which the source is 
activated, the firing of the detectors produces a data set inte- 
gers {x„ |x„ = ± 1 ; « = 1 , . . . , A^} where x„ = ±l represents 
the firing of D± . 



B. Inference-probability of the data produced by the 
experiment 

In complete analogy with Section lTVBI we have 

1. The i-prob to observe an event x = ±1 is denoted by P{x\a ■ 
S,Z) where Z represents all the conditions under which the 
experiment is performed, with exception of the directions 
a and S of the magnet and magnetic moment, respectively. 
It is assumed that the conditions represented by Z are fixed 
and identical for all experiments. It is expedient to write 
P{x\a-S) as 

/ , ^ X / , X l+xE(9,Z) 
P{x\a-S,Z)=P{x\e,Z)^ (25) 

where 

E{e)=E{a-S,Z)= ^ xP(Ai0,Z). (26) 

2. For simplicity, it is assumed that there is no relation be- 
tween the actual values of Xn and x„i if n ^ n' . In other 
words, as far as we know, each repetition of the experi- 
ment represents an identical event of which the outcome is 
logically independent of any other such event. Repeated 
application of the product rule yields 

N 

. . . ,x^|a • S,Z) = Y\P{xi\e,Z), (27) 
(=1 

meaning that the i-prob to observe the event {x\ , . . . ,XAf} is 
uniquely determined by the i-prob to observe the event x. 

C. Condition for reproducibility 

Enforcing the condition of reproducibility, exactly the same 
reasoning that leads to Eq. ( fTSl l now yields 



from which it follows that in order to minimize the variation 
of Eq. ( |28] ) as a function of e as much as possible, we should 
minimize the Fisher information 

^ ' fM^iy (29) 

The method of solution is identical to the one employed 
in Section HV CI Using the fact that E{Q) is a function of 
a • S = cos only, we find that there are two solutions, namely 
E{6) = ±cos 0. Therefore, we have 

p{x\a-S,Z)=P{x\e,Z) = ^^^, (30) 

in agreement with the expressions of the quantum theoretical 
expression for the probability to deflect the particle in one of 
the two distinct directions labeled by x = ±1 ll47ll . The ± sign 
in Eq. ( [30l ) reflects the fact that the mapping between x = ±1 
and the two different directions is only determined up to a 
sign. 



D. Discussion 

In quantum theory, Eq. (l30l l is in essence just the postulate 
(Born's rule) that the probability to observe the particle with 
spin up (corresponding to say a = +1) is given by the square 
of the absolute value of the amplitude of the wavefunction 
projected onto the spin-up state ll47ll . Obviously, as quantum 
theory is noncontextual iH H^l (the variability of) the condi- 
tions under which an experiment is carried out is not included 
in the quantum theoretical description. In contrast, in the log- 
ical inference approach, Eq. (l30t is not postulated but follows 
from the assumption that the (thought) experiment that is be- 
ing performed yields the most reproducible results, revealing 
the conditions for an experiment to produce data which is de- 
scribed by quantum theory. 



VI. PARTICLE IN A POTENTIAL: SCHRODINGER 
EQUATION 

Sections |IV] and |V] showed that, with a minimum of input 
about the nature of an experiment, simply demanding that the 
recorded data sets of events yield reproducible results for the 
i-probs, leads to expressions that are known from the quan- 
tum theoretical treatment of the experiment. In essence, these 
results derive from the following ideas: 

(i) The i-probs for events to occur obey the rules of the al- 
gebra of logical inference. 

(ii) The i-prob to observe an event (labeled by {x,y} or x) 
depends explicitly on a variable condition (represented 
by the variable 6). 
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(iii) Maximizing the robustness of the i-prob to observe the 
data with respect to small variations of the condition 
yields the functional dependence of the i-prob on this 
condition. 

This section shows that extending this approach to a particle 
in a potential is straightforward. The key points are to formu- 
late precisely what it means to perform a robust, reproducible 
experiment and to feed in knowledge about the Newtonian dy- 
namics of the particle. We consider the time-independent case 
and to keep the notation simple, we only treat the case of a par- 
ticle on a line. The extension to 2- or 3-dimensional space and 
the time-dependent case is given in Section IVlIl 



A. Experiment 

We consider the following experiment. A particle is located 
on a line segment [— L,L], relative to a fixed reference frame. 
Its unknown position is denoted by 0. We have a source that 
emits a signal which always solicits a response of the particle. 
We cover another line segment [— with 2M + 1 detectors 
of width A, where MA ~ L. The signal that arrives at the detec- 
tor — M < j < M is assumed to be particle-like, that is for each 
signal emitted by the source, only one of the 2M + 1 detectors 
actually fires. Each detector operates with 100% efficiency, 
meaning that it fires whenever a particle-like signal arrives. 

The result of a run of the experiment is a data set of detector 
clicks 



r = {jn\-M<j„<M;n^l,...,N}. 



(31) 



Denoting the total count of detector j by < kj < N, the ex- 
periment produces the data set 



.,kM\N = k^M + --- + kM}- 



It is assumed that there is no relation between the actual 
values of y'„ and y„/ if n ^ n' . In other words, each rep- 
etition of the experiment represents an identical event of 
which the outcome is logically independent of any other 
such event. As mentioned before, events with these prop- 
erties are called Bernoulli trials, a concept which is central 
to many results in probability theory lfl4l [Tsl |22| . By the 
standard combinatorial argument, the number of possible 
ways to generate the data set & is given by 



k^M^....kM\ 



(34) 



The logical consequence of the Bernoulli-trial assumption 
is then that 

P{S!\d,N,Z) = P{k^M\k-M+x e.N.Z) 
xP{k^M+u---,kM\e,N,Z) 

= p{k^M\e,N,z)p{k^M+i ,...Md,N,z) 



--NsP{-M\e,N,zf 



j=-M 



kj\ 



.P{M\e,N,zf'^ 

(35) 



In physics we often assume that space is homogeneous, im- 
plying that it does not matter where in space we perform the 
experiment. For the model at hand, this means that a trans- 
lation of the unknown position 6 and the array of detectors 
by the same distance should not affect our inferences based 
on the data. In other words, the i-prob has the property 



(32) 



Pij\e,z)^p{j + Qe + c,z), 

where ^ is an arbitrary real number 



(36) 



B. Inference-probability of the data produced by the 
experiment 

A priori, the relation between the unknown location 9 of 
the particle and the location j of the detector which fires is 
unknown. Therefore, to describe this relation, we introduce 
the i-prob P{j\9,Z) that the particle at unknown location 9 
activates the detector located at the position — M < j <M. As 
before, the conditions represented by Z are fixed and identical 
for all experiments. As in Sections|IV]and |V] the key question 
is what the requirement of reproducibility tells us about the i- 
prob P{j\9,Z) as a function of 9. Note that unlike in the case 
of parameter estimation, in the case athandbothP(y|0,Z) and 
the parameter 9 are unknown. 

The following assumptions are essentially the same as those 
of Sections FlV Bl and IV Bl and are listed here for completeness. 

1. For fixed position 9, the i-prob to observe the data is given 
by 



P{S>\9,N,Z) = P{k^M, . . .,kM\0,N,Z). 



(33) 



C. Condition for reproducibility 

Comparing Eq. ( l35T l and Eq. ( fT2] i. it is not a surprise that by 
simply repeating all the steps that lead to Eq. ( fTSl l. the condi- 
tion for reproducibility applied to Eq. (|35t yields the evidence 



Ev = 



M 



]^^PU\e,z) V 



f dPU\9,Z) 
39 



{£')■ (37) 



At this point, to make contact with the Schrodinger equation 
which is formulated in continuum space, it is necessary to re- 
place Eq. ( |37] | by its continuum limit 



Ev== 



dx 



1 



P{x\9,Z) 



dP{x\9,Z) 
39 



me'), 



(38) 

where we have assumed that the width of the detectors ap- 
proaches zero (A 0) and the length of the line segment ap- 
proaches infinity (L °°). Making use of translational invari- 
ance (see Eq. ( |36] |) we have 



dPix\9,Z) 
39 



lim 

5^0 



P{x\9 + 5,Z)-P{x\9,Z) 
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lim 

5^0 



p{x-8\e,z)-p{x\e,z) 



dp{x\e,z) 

dx 



(39) 



Hence, we may replace the partial derivative with respect to 9 
by the partial derivative with respect to x, yielding 



Nf 

Ev = -^/H0) + ^(e'), 



where 



/f (0) = / dx 



Pix\e,z) 



dp{x\e,z) 

dx 



(40) 



(41) 



denotes the Fisher information of the experiment considered 
in this section. Obviously, minimizing Eq. PTt as we did for 
the EPRB and Stern-Gerlach problem cannot yield a solution 
which incorporates the fact that the particle moves in a po- 
tential simply because this knowledge is not yet built into the 
minimization problem. 

According to classical mechanics the orbit in phase space 
of a particle is given by the solution of the time-independent 
Hamilton-Jacobi equation (HJE) 



1 

2m 



dsje) 
de 



-Vie) 



0, 



(42) 



where m, S{0),V{9) andE denote the mass of the particle, the 
action (Hamilton's principal function), the potential, and the 



energy, respectively. Note that 9 represents the position of the 
particle which, in classical mechanics, is assumed to be known 
with certainty. The HJE describes experiments for which there 
is no uncertainty about each individual event (category 1). 

If there is uncertainty about the position 9 but not about 
the individual event (category 2), this uncertainty may be 
captured by assuming that the i-prob P{x\9,Z), has a par- 
ticular functional dependence, e.g. P{x\9,Z) = expl—ix-- 
0)^/2(7^]/\/27ra2. Given that the equation which determines 
the action S{x) should reduce to Eq. ( |42] | in the limit that 
P{x\9,Z) 5{x~ 9), the simplest equation reads 



dx 



dS{x 



^2m[V{x) -E] 



P{x\9,Z)=0. (43) 



In words, Eq. ( l43T l tells us that the inference drawn from the 
distribution of detector clicks as function of their location on 
the line, is that, on average, these locations satisfy the time- 
independent HJE. 

Finally, if there is uncertainty about the individual event as 
well as about the conditions (category 3), the i-prob P{x\9,Z) 
is unkown but can be determined by requiring that the fre- 
quency distributions of the observed events are robust (cate- 
gory 3a). It is important to note that in this case, there is no 
assumption about the unknown position 9 of the particle. 

Inspired by Schrodinger's original derivation [51| of his 
equation (see Section |V1 EK we minimize the Fisher infor- 
mation Eq. i4J\ with the constraint that the time-independent 
HJE only holds on average. Specifically, the functional to be 
minimized reads 



Fi9) 



dx 



1 



P{x\9,Z) 



dP{x\9,Z) 



dSjx) 
dx 



+ 2m[V{x)-E] 



Pix\9,Z) 



(44) 



where A is a Lagrange multiplier It is important to note that without changing the minimization problem, we may substitute 
P{x\9,Z) — >■ aP{x\9,Z) where a is any nonzero real number Therefore, any solution for P{x\9,Z) obtained by minimizing 
Eq. ( |44] | can be normalized by P{x\9,Z) P{x\9,Z)/ f^^dx P{x\9 ,Z). Hence, there is no need to introduce a Lagrange 
multiplier to impose the normalization condition on P{x\9,Z). 

We do not know of any direct analytical method to solve the nonlinear minimization problem Eq. (l44t . However, from 
Madelung's hydrodynamic-Uke formulation |52| or Bohm's interpretation llssll of quantum theory it follows that the extrema 
(and therefore also the minima) of Eq. (|44] | can be found by solving the time-independent Schrodinger equation. 

With a minimum of algebra this can be shown as follows. We start from the functional 



QiO) 



dx i 4 



,<9i/*(x|0,Z) d\if{x\9,Z) 



Substituting 



\j/ix\9,Z) = ^P{x\9,Z)e' 



' 2mA [V (x) - E] V/* {x\9,Z)\f/{x\9, Z) 



(45) 



(46) 



yields Eq. ( l44b . 

On the other hand, from a standard calculation using the variation \i/*{x\9,Z) — > \jf*{x\9,Z) + 5\j/*{x\9,Z), it follows that the 
extrema of Eq. ( l45T l are given by the solutions of the linear eigenvalue problem. 



d^\f/{x\9,Z) mX 



dx^ 



— [y(x)-£]v/(x|0,Z)=O, 



(47) 
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which is nothing but the time-independent Schrodinger equation with A = 4/7j^. Planck's constant Ti enters here because of 
dimensional reasons (see also Section lVlEb and it sets the energy scale of experiments which belong to category 3a. As Eq. (|47] | 
is a linear second-order partial differential equation, in practice computing its solution requires the specification of two boundary 
conditions on v/(x|0,Z). 

I 



In establishing the equivalence between Eq. ( |44] | and 
Eq. (l45T l. we introduced the complex-valued function \lf{x) to 
represent the two real-valued functions P{x\9,Z) and S{x). 
However, in the case that the solutions of Eq. ( |47] i are real- 
valued, we have S{x) = mod 2n. Hence, it would be suffi- 
cient that v/(x) is a reaZ-valued function. On the other hand, 
it is a simple matter to repeat the derivation and show that 
minimization of the Fisher information with the constraint 
that on average, the HJE of a particle in an electromagnetic 
field should hold leads to the corresponding time-independent 
Schrodinger equation (see also Section lVirt . Then, in general, 
it is necessary to introduce a complex-valued function i//(x) to 
linearize the minimization problem. 



D. Discussion 

Starting from the assumptions that the experiment belongs 
to category 3a and averages of the observed data complies 
with Newtonian mechanics, application of logical inference 
straightforwardly leads to the time-independent Schrodinger 
equation Eq. (|47] |. The key step in this derivation, which in 
essence is the same as in Sections HVl and IVl is to express the 
robustness of the observed data (distribution of frequencies of 
the events) with respect to small variations in the unknown po- 
sition of the particle, taking into account the inference that we 
draw on the basis of the observed data, namely that on average 
there is agreement with Newtonian mechanics. 

Of course, a priori there is no good reason to assume that on 
average there is agreement with Newtonian mechanics. The 
only reason to do so here is that only then we recover the time- 
independent Schrodinger equation. In other words, the time- 
independent Schrodinger equation describes the collective of 
repeated experiments of category 3a subject to the condition 
that the averaged observations comply with Newtonian me- 
chanics. The question what kind of equations are obtained by 
assuming a different kind of "mechanics" is out of the scope 
of the present paper 

It is very important to emphasize that from the logical- 
inference viewpoint the superposition principle, that is, the 
linearity of the Schrodinger equation, is not fundamental but 
follows from the fact that in classical mechanics the kinetic 
energy is quadratic in the velocities and, thus, in the momenta. 
Only in this case the substitution Eq. ( |46] l reduces the nonlin- 
ear minimization problem to a Unear equation. This raises 
the question what to do with different types of classical me- 
chanics, such as relativistic mechanics. It is well known, that 
relativistic quantum mechanics cannot be mechanics, it can 
only be afield theory ll54l - l56ll . the argument being that any at- 
tempt to measure the coordinate of a particle with the accuracy 
better than its Compton wavelength unavoidably leads to the 
creation of particle-antiparticle pairs. We leave the challeng- 



ing problem of extending the present work to the relativistic 
domain for future research. 

A comment on the identification X = A/Tr' is in order 
Clearly, from a dimensional analysis of Eq. (l47i . A has to be 
a parameter with the dimension of h^^ but there is no a-priori 
reason why we must have X =4//!^. However, comparing the 
results of a numerical calculation based on Eq. (|47T i with spe- 
cific experimental results for the spectra of atoms etc., we are 
forced to choose X ~ A/Tp'. It is worth mentioning here that 
the logical-inference derivation of the canonical ensemble of 
statistical mechanics [23i, .241 employs the same reasoning to 
relate the inverse temperature j3 = l/^^r to the average ther- 
mal energy. 

A very important point, which renders our treatment very 
different from other statistical formulations of quantum the- 
ory 10, IstI - ItoIi is that the unknown position of the particle 
never appears in the solution of the problem. It appears as a 
condition on the i-probs but it has no effect on the functional 
dependence of the i-probs on the relevant, observable coordi- 
nate X. 

From quantum theory we know that Eq. (|47] | usually has 
more than one solution, the minimum of Eq. ( |45] | correspond- 
ing to the quantum state with the lowest energy and the oth- 
ers being excited states. The latter correspond to extrema of 
Eq. ( l44b with values of F{Q) that are larger than the minimum 
value of F{Q). 

It is easy to show, directly from Eq. (l44l l. that at an ex- 
tremum (with respect to variations in f (x|0,Z) and S(x), not 
to 9) the derivative of F{d) with respect to 9 is zero, that is 



dF{e) 



de 



= 0. 



(48) 



Extremum of 



In other words, the excited quantum states describe experi- 
ments which are not the "most" robust against small changes 
of Q but nevertheless have the property that, to first order, the 
"quality" of the results (i.e. averages, etc.) does not depend 
on the particular value of 0. 

If we were to follow the tradition of conventional statistics, 
we would introduce, for instance, an estimator {x) for and 
assume that the expectation value of this estimator relates to 
the "true" position of the particle. As we know from the early 
days of the development of quantum theory iItiIi trying to in- 
terpret such estimators as objective properties of the particle 
creates seemingly endless possibilities for different interpre- 
tations, paradoxes, and mysteries From the viewpoint of 
logical inference, Q was and remains unknown and any at- 
tempt to interpret the function (//(x) seems superfluous; ^/(x) 
is just an extremely useful vehicle to compute the numerical 
values of the i-probs P(x|0,Z). 
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E. Historical note 



A. Experiment 



It is of interest to repeat here the first few steps in 
Schrodinger's first paper on his equation llsil . For simplic- 
ity, we consider a particle moving on a line only. Schrodinger 
starts from the time-independent HJE. 



Hix, 



where 



Hlx, 



dSjx) 
dx 



dSjx) 
dx 



1 

2m 



dS{x) 



'Vix), 



(49) 



(50) 



is the Hamiltonian of the classical, Newtonian particle. Then, 
in Eq. ( |49l ) he substitutes 



Six)=Klogxi/{x), 



(51) 



where y/{x) is assumed to be a real single-valued function of 
X and K isa constant with the dimension of action and obtains 



Hlx, 



K d\if{x) 



= E. 



Then, Schrodinger observes that one can rewrite Eq. 
quadratic form in ^{x), namely 

2 



fd\lf{x) 
2m \ dx 



+ [V{x)~E]\i/^{x)^0. 



(52) 



as a 



(53) 



Of course, solving Eq. (|53] | for \lf(x) does not bring anything 
new. Therefore, Schrodinger postulates that instead of solving 
Eq. (|53] |. one should search for the extrema of the functional 



Q = 



dx 



2m \ dx 



+ [V{x)-E]xi/\x) 



(54) 



knowing that the formal solution of this variational problem 
leads to an eigenvalue problem. He then continues to show 
that by using the classical Hamiltonian for the Kepler prob- 
lem, the solution of the eigenvalue problem yields the spec- 
trum of the hydrogen atom. 

It is quite remarkable that in his next publication on the sub- 
ject 1I72I1 . Schrodinger calls both the ansatz Eq. (ISTT i and the 
transition from Eq. (l5Jt to Eq. (l54t incomprehensible ("un- 
verstandlich") and then goes on to motivate his equation us- 
ing the analogy with optics. As shown by our derivation of 
Eq. (07]), which on choosing A = is the same as Eq. ( |53] ). 
from the viewpoint of logical inference applied to experiments 
of category 3a, there is nothing incomprehensible to Eq. ( |54] i. 



VII. TIME-DEPENDENT SCHRODINGER EQUATION 

Extending the reasoning which yields the time-independent 
Schrodinger equation to the time-dependent, multidimen- 
sional case does not require new concepts but simply replac- 
ing the position on the line by a vector in 3D space and 
adding time labels does not suffice. Therefore, in what fol- 
lows, we focus on those aspects which are absent in the exam- 
ples treated in Sections UVUVII 



We consider repetitions of a thought experiment on a par- 
ticle moving in a -dimensional hypercube CI of linear extent 
[— L,L], relative to a fixed reference frame. Here and in the 
following c/ is a positive integer A source emits a signal at dis- 
crete times labeled by the integer t = 1 , . . . ,M. It is assumed 
that for each repetition, the particle is at the unknown position 
dr £ CI. As the particle receives the signal, it responds by 
emitting another signal which is recorded by an array of de- 
tectors. For each signal emitted by a particle the data recorded 
by the detector system is used to determine the position _/'„ ^ of 
a voxel of linear extent [—A, A] in the cZ-dimensional space £1. 
The dimension of the voxels determines the spatial resolution 
of the detection system. As in Section lVTl in a later stage, we 
will let A — > to solve the problem analytically. 

The result of repetitions of the experiment yields the data 
set 

T = {j„,r\j„.r e hL',L% « = 1, . . . ,A?; T = 1 , . . . ,M}, 

(55) 

or, denoting the total counts of voxels j at time t by < ^ < 
A^, the experiment produces the data set 

,M;N= ^ kj,,\. (56) 

j€l-Ld,L'l] 



1, 



B. Inference-probability of the data produced by the 
experiment 

In analogy with the procedure followed in the previous sec- 
tions, we introduce the i-prob P{j\6 ,T,Z) to describe the re- 
lation between the unknown location 6 and the location j of 
the voxel determined by the detector system at discrete time 
T. Except for the unknown location 6, all other experimental 
conditions are represented by Z and are assumed to be fixed 
and identical for all experiments. Note that unlike in the case 
of parameter estimation, in the case at hand both P{j\6 ,Z,Z) 
and the parameter 6 are unknown. As in all examples treated 
so far, the key question is what the requirement of repro- 
ducibility tells us about the i-prob P{j\9,z,Z) as a function 
of 0. 

The following assumptions are essentially the same as those 
of Sections HVBirVBl and lVTBl 

1 . It is assumed that each repetition of the experiment repre- 
sents an identical event of which the outcome is logically 
independent of any other such event. By application of the 
product rule, the consequence of this assumption is that 



M 



P{&\di,...,dM,N,Z) 



^'D n 

T=l je[-L'',L''\ 



P{j\er,T,Zfj 



(57) 



2. As in Section IVIBI we assume that space is homogeneous. 
This implies that the i-prob has the property 

P(7|0,Z)=P(i+C|0 + C,Z), (58) 
where ^ is an arbitrary vector in (/-dimensional space. 
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C. Condition for reproducibility 



to Eq. ( fTsT i, we find that 



Ev = EE 



e,e,v dP{j\e„T,Z) dPU\er,T,Z) 



In Sections lrVBllV Bl and l VI BI the variable condition is a 
scalar variable whereas in the present case, 9 denotes a collec- 



(59) 

where we have dropped the irrelevant prefactor —N/2 and 
tion of d scalars. This has some impact on the expression for omitted from the summation sign the range of T and j (see 
the evidence. Repeating the steps that took us from Eq. (fl4] i Eq. (|57] |) and the terms of third and higher order in the e's. 

I 



The condition for reproducibility applied to Eq. (|59) requires that we minimize Ev (which is non-negative, see Eq. (|60ll). A 
minor problem thereby is that the e^-'s are arbitrary (but small) but we can get around this problem by noting that 



dP{j\er,T,Z) 



EiEi, dP{j\er,T,Z) dP{j\er,T,Z) 



^f-^pU\e,,T,z) 



ddi 



39:, 



1 



f dp{j\er,T,z y 



(60) 



r 



where we used the Cauchy-Schwarz inequaUty. From 
Eq. ( l60l l. it follows that as the e,'s are arbitrary, minimizing 
the rightmost factor in Eq. ( |60l l is the best we can do to make 
sure that Ev is as small as possible. Therefore, we find that 
in order to realize the condition for reproducibility we have to 
minimize the Fisher information 



If 



d 

EE 



1 



fdp{j\e„z,z) 



f-^fiPij\er,T,z) V 



(61) 



subject to additional constraints that we impose (see below). 

As before, to make contact with the Schrodinger equation 
which is formulated in continuum space-time, it is necessary 
to replace sums over space-time coordinates by integrals. In- 
voking translational invariance (see Section IVIIBb . we have 



1 



where x = (xi , . . . ,x^i). 



dp{x\e{t),t,z) 

dxi 



(62) 



We include the knowledge that the particle moves in a time- 
dependent electromagnetic field and time-dependent potential 
by repeating the steps of Section|Vl]that lead us from Eq. (l42l i 
to Eq. ( l44l i. that is we start from the classical HJE and then 
account for the uncertainties about the events. 

According to classical mechanics, the motion of a particle 
with mass m in a time-dependent electromagnetic field and 
time-dependent potential is governed by the time-dependent 
HJE 



-V(0,f)=O, (63) 



where q denotes the electrical charge of the particle, c is the 
velocity of light in vacuum, A{x,t) represents the vector po- 
tential and the electrical potential and all potentials of non- 
electromagnetic origin are collectively denoted by V{x,t). 

Using the same argument as the one in Section |VI] if there 
is uncertainty about the position 6 but not about the individ- 
ual event (category 2), the simplest equation for S{x,t) which 
reduces to Eq. (|63] | in the limit that there is no uncertainty 
reads 



dx 



dS{x,t) 



dt 



— V5(jc,0--A(x,f) 
Zm \ c 



-V{x,t) 



p{x\e{t),t,z)^Q, 



(64) 



for each value of t. If there is uncertainty about both the 
individual event and the conditions (category 3), the i-prob 
P{x\0{t),t ,Z) is unkown but can be determined by requiring 
that the frequency distributions of the observed events are ro- 
bust (category 3a). Note that no assumption about the un- 



I 

known position 6 of the particle has (or will) been made and 
that this Une of reasoning, which is reminiscent of Ehrenfest's 
theorem iFtsIi . does not determine P{x\d{t),t ,Z) but merely 
provides a constraint on it. 

Minimizing the Fisher information Eq. (|62] | with the con- 
straint Eq. (|64] i amounts to minimizing the functional 
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F ^ dx dt 



1 



P(x|0(O,f,Z) 



dp{x\e{t),t,z) 



dS{x,t) 1 fdS{x,t) q 



dt 



2m 



A{x,t) +V{x,t] 



p(x\eit),t,z) 



(65) 



where A is a Lagrange parameter and the normalization of P{x\6{t),t,Z) can be taken care of by exploiting the invariance of the 
extrema of Eq. d&Sl l with respect to the rescaling transformation P{x\d{t),t,Z) — > aP{x\d{t),t,Z). Note that the integrand of 
Eq. i65[ is invariant for the gauge transformation A{x,t) — > A{x,t) + (q /c Wx ixj), V{x,t) — ?> V{x,t) — {q/c)dx{x,t)/dt, and 
S(x,t) S{x,t) + {q/ c)x{x,t) where x(.Y,r) is an arbitrary scalar function ||74|| . 

In analogy with Section [VTl it follows that finding the extrema of the functional Eq. (l65l l is tantamount to solving the time- 
dependent Schrodinger equation (TDSE). Applying the standard variational argument, it follows that the solutions of the TDSE 



ih 



d^if{x\d{t),t,Z) 
dt 



2m V (^Xi 



iq 
tic 



A(x,f) +V{x,t) 



V/W0(f),f,Z), 



(66) 



are the extrema of the functional 
[ dx [ dt \ 2m;VX 



dt 



d 



' d^*{x\e{t),t,Z) ^ iq^/X 
dxj 2c 



Aj{x,t)xi/*ix\dit),t,Z) 



' d\(f{x\B{t),t,Z) iqy/X 



dxj 



2c 



Aj{x,t)\l/{x\e{t),t,Z) 



-2mXV{x,t)\ir*{x\e{t),t,Z)\if{x\e{t),t,Z) 



(67) 



if A = 4/tP'. The equivalence of Eq. ( |65] ) and Eq. ( |67] | follows 
by substituting y/{x\e{t),t,Z) = ^P(jt:|e(f),f ,Z)e''^(-'''''^/2. 
Note that the solutions of Eq. ( |66] l do not depend on the un- 
known position 6{t), as it should be. 

The functional Eq. ( |67] i inherits from Eq. (|65t the in- 
variance under gauge transformations. Specifically, it is 
easy to show that the integrand in Eq. ( |67] | does not 
change by substituting A{x,t) ^ A{x,t) + (^\/X/2c) V j (x, f ), 
V{x,t) ^ V{x,t)~- {q/c)dx{x,t)/dt, and \lf{x\d{t),t ,Z) 
\lf{x\6{t),t,Z)ex^[iq^fXx{x,t)/2c] where x(x,f) is an arbi- 
trary scalar function. 

Instead of solving the nonlinear differential equation that 
follows from extremizing Eq. (l65t . it is usually more expe- 
dient to solve the linear partial differential equation Eq. ( |66] ). 
Of course, in practice we need to specify v/(jc|9(f),f ,Z) at 
f = in order to solve the initial-value problem Eq. (l6&t . 
Unlike in the time-independent case (see Section IVII ) where 
we may have solutions for which S{x) ~ mod 2%, in the 
general case, the equivalence between Eq. ( |65] | and Eq. (l67T i 
cannot be established unless we allow \//(jc|6(f),f ,Z) to be 
complex-valued. In general, minimizing Eq. ( l65l l yields so- 
lutions for the two real-valued functions P(jc|0(f),f ,Z) and 
S{x,t) and although we can represent these two function in a 
variety of ways, the complex-valued representation in terms of 
V/(jt|9(f),f,Z) offers the for computational reasons very im- 
portant advantage that it transforms a nonlinear optimization 
problem into a linear one. 

The equivalence of Eq. ( |65l l and Eq. WT\ allows us to 
determine, from the solutions of the TDSE, the i-probs 



P(x|0(f),f ,Z) which yield the most likely and most repro- 
ducible data, collected in the experiment described in Sec- 
tion IVll A| Put differently, through the TDSE, quantum the- 
ory describes an experiment which yields data that is the most 
robust with respect to small variations of the external condi- 
tions (the unknown positions of the particle) under which the 
experiment is being performed. 

D. Discussion 

In essence, all the points that were mentioned in the discus- 
sions in Sections JV] - [Vl] also hold for the time-dependent 
case. Of course, one should replace for instance "time- 
independent" by "time-dependent", but otherwise there are no 
significant conceptual changes. 

There is a considerable body of work |0 IstI - ItoIi which 
shows that quantum theory can be cast into a "classical" sta- 
tistical theory defined by the functional Eq. (l65T l (or variations 
thereof). The idea that the Fisher information may play an im- 
portant role in building this relationship appears, to the best of 
our knowledge, for the first time in a paper by Frieden iIstI . 
Furthermore, the Heisenberg-Robertson inequalities, often re- 
garded as a landmark of quantum mechanics, directly follow 
from the Cramer-Rao inequaUty L57, 60-62, 65^ IH, a 
standard result in probability theory ll62fl . 

From a general perspective, it is instructive to compare the 
methodology adopted in the present paper with the one of the 
earlier works 10, dThTOII in which, in essence, it is postulated 
that the Fisher information is the basic expression from which 
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the equations of theoretical physics can be derived. The ex- 
pressions of functionals akin Eq. ( |65T l are justified using ar- 
guments from estimation theory, and concepts such as intrin- 
sic fluctuations and "smart measurements". Thereby, it seems 
essential that the difference between the parameter to be esti- 
mated (e.g. 6{t) in our notation) and the measured quantity 
(e.g. X) may be interpreted as intrinsic fluctuations. 

Taking Frieden's treatment of the EPRB experiment as an 
example ll62ll . it seems far-fetched to regard the angle 6 as 
the variable-to-be-estimated. Indeed, we do not know of any 
real EPRB experiment which attempts to estimate this angle. 
Moreover, mathematically we cannot even define the differ- 
ence between the event {x,y} and the angle 9, let alone that 
we can interpret this difference as a signature of intrinsic fluc- 
tuations. Yet, as we have shown in Section lTVl straightforward 
application of logical inference to an experiment assumed to 
belong to category 3a, effortlessly yields the equations of the 
quantum theoretical description for this experiment. 

In the logical inference approach, the Fisher information is 
not postulated to be the key concept but appears as a result of 
expressing the requirement that the experiment yields repro- 
ducible, robust results, not only for the EPRB experiment but 
for all examples discussed in this paper 

In short, in the logical inference approach the expression to 
be minimized is not postulated but it is derived by assuming 
that the theory describes reproducible experiments in the most 
robust possible way. 



VIII. CONCLUSION 

We have shown that the basic equations of quantum theory 
derive from logical inference applied to experiments in which 
there is uncertainty about individual events but for which the 
frequencies of events are reproducible and most insensitive to 
small variations of the unknown factors. 

The derivations presented in Sections HVTlVllI demonstrate 
that logical inference, that is plausible reasoning, applied to 
experiments which belong to 

Category 3a. There is uncertainty about each event, the con- 
ditions under which the experiment is carried 
out may be uncertain, and the frequencies with 
which events are observed are reproducible and 
robust against small changes in the conditions, 

yields two important, general results. 

The first is the justification of the intuitive procedure to as- 
sign to the i-probs the frequencies for the events to occur For 
fixed experimental conditions, the usual argument for adopt- 
ing this assignment is that it maximizes the i-prob to observe 
these frequencies (see Appendix). On the other hand, it is 
quite natural to expect that under variable experimental con- 
ditions it is the most robust, reproducible experiment which 
produces the most likely frequencies of events. Obviously, 
the argument based on reproducibility under variable experi- 
mental conditions is more general as it contains the condition 
affixed experimental conditions as a special case. 



The second, and most important for the purpose of recov- 
ering the quantum theoretical description as an application of 
logical inference, are equations that determine the functional 
dependence of the i-probs on the condition that is considered 
to be variable. Application of exactly the same procedure 
to the Einstein-Podolsky-Rosen-Bohm experiment, the Stern- 
Gerlach experiment, and experiments on a particle in a po- 
tential demonstrate that the equations known from the quan- 
tum theoretical description of these experiments follow in a 
straightforward manner without invoking concepts of quan- 
tum theory. 

The key point in the derivation of the quantum theoretical 
description is to express precisely and unambiguously, using 
the mathematical framework of plausible reasoning |[l2l - [l6ll . 
the essential features of experiments belonging to category 3a. 
Adding the requirement that the experimental results are in- 
sensitive to small changes of the conditions under which the 
experiment is carried out not only yields the equations known 
from quantum theory but also explains, not postulates, why 
quantum theory is noncontextual 111]. Furthermore, it also ex- 
plains that if it is difficult to engineer nanoscale devices which 
operate in a regime where the data is reproducible, it is also 
difficult to perform these experiments such that the data com- 
plies with quantum theory. 

The logical-inference methodology to derive the basic 
equations of quantum theory has some implications for inter- 
pretational aspects of quantum theory. First, although it sup- 
ports Bohr's view expressed in quotes (1-3) of the introduc- 
tion, it does not support the Copenhagen interpretation (in any 
form) 01]. Indeed, the wave function Eq. (|46] l merely appears 
to be a purely mathematical vehicle to turn nonlinear differen- 
tial equations into linear ones and it seems difficult to attribute 
more meaning to such a vehicle other than mathematical. On 
the other hand, there is no conflict with the statistical interpre- 
tation ll47ll75ll if we ignore the conceptual difference between 
i-probs and "mathematical" probabilities. Second, it follows 
that quantum theory is a "common sense" description of the 
vast class of experiments that belongs to category 3a. Quan- 
tum theory definitely does not describe what is happening to a 
particle, say. This follows most clearly from our derivation of 
the Schrodinger equation, which shows that quantum theory 
does not provide any insight into the motion of a particle but 
instead describes all what can be inferred (within the frame- 
work of logical inference) from or, using Bohr's words, said 
about the observed data, in complete agreement with Bohr's 
view expressed in quotes (1-3) of the introduction. 

The logical-inference derivation of the quantum theoreti- 
cal description does not, in any way, prohibit the construc- 
tion of cause-and-effect mechanisms that, when analyzed in 
the same manner as in real experiments, create the impression 
that the system behaves as prescribed by quantum theory [f76l - 
ItsIi . From Bohr's quote (1) reproduced in the Introduction, 
and as demonstrated in a mathematically rigorous manner in 
the present paper, quantum theory is but an abstract descrip- 
tion, be it a very powerful one. As mentioned in Section HIH 
it is straightforward to construct computer simulation models 
that mimic, for all practical purposes almost perfectly, experi- 
ments that belong to category 3a. Work in this direction, for a 
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review see Ref.|79l has shown that it is indeed possible to build 
simulation models which reproduce, on an event-by-event ba- 
sis, (quantum) interference and entanglement phenomena. 

Summarizing: In line with Bohr's statement that "Physics 
concerns what we can say about nature IJ]", the aim of 
physics is to provide a consistent description of relations be- 
tween certain classes of events. Some of these relations ex- 
press cause followed by an effect and others do not. If there 
are uncertainties about the individual events and the condi- 
tions under which the experiment is carried out, situations 
may arise in which it becomes difficult or even impossible 
to establish relations between individual events. In the case 
that the frequencies of these events are reproducible, it may 
still be possible to establish relations, not between the individ- 
ual events, but between the frequency distributions of the ob- 
served events. As we have demonstrated, it is precisely under 
these circumstances that the apphcation of logical inference to 
the (abstraction of) the experiment yields the basic equations 
of quantum theory. This then also explains the reason for the 
extraordinary descriptive power of quantum theory: it is plau- 
sible reasoning, that is common sense, applied to reproducible 
experimental data. The algebra of logical inference facilitates 
this reasoning in a mathematically precise language which is 
unambiguous and independent of the individual. 
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nlP{Oi\N,Z) < {n\ + l)P{Ot\N,Z), (A.3) 
for all 2 < /: < m. Summing over all k yields 

m m 

P{0,\N,Z) Y^nl< K + 1) ^ P{Ok\N,Z), 

k=2 k=2 



or 



P{0,\N,Z){N -n\) <{n\ + m~P{0,\N,Z)), 



or 



P(Ot|A^,Z)<|±l. 



(A.4) 



Similarly, if we consider 



P{n\,...,nl...Xn\N,Z) _ nl + \P{Or\N,Z) 

P{n\-\,...,nl + \,...,n%,\N,Z) n\ P(Oi|A^,Z) " ' 

(A.5) 

we find 



P(px\N,Z)> 



N + m- 1 



(A.6) 



In the derivation that leads to Eqs. (I A. 4b and (IA.6b . our choice 
of the pair {\,k) (2 < k) was arbitrary. Repeating the deriva- 
tion for 1 < j,k <m with k j yields 
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n* n* + 1 
^ < PiO,\N,Z) < , l<j< m, (A.7) 

or equivalently 



Appendix: Maximum of the inference-probability 

We consider an experiment with logically independent out- 
comes 6*1, .. . ,0i„ which is repeated times under constant 
conditions represented by the proposition Z. The i-prob that 
the outcome (9jt occurred times reads 



P(«1,...,«„,|A?,Z): 



-P{Oi\N,zy^...P{0,„\N,Z)""'. 

(A.l) 

Let us denote the set of values of {«i , . . . ,«„,} which maxi- 
mizes . . . ,nn,\N,Z) by {«|, . . . ,«,*„}. Then, we must have 

P(«*,...,4,...,n,;|jV,Z) _ n\ + lP{0,\N,Z) 

P{n\ + l,...,nl-l,...,n*jN,Z) «* PiQi\N,Z) - ' 

(A.2) 



1 n*: / TO- 2 

P(Oj\N,Z) < — ^ < P(Oi\N,Z) 1 + 

^ ' ^ N+l - N+l - ^ ' \ N+l 

(A.8) 

for all I < j < m. 

For sufficiently large A^, it follows from Eq. ( IA.7b that for an 
experiment with logically independent outcomes Oi, . . . ,0„, 
which is repeated times under constant conditions repre- 
sented by the proposition Z, the assignment 



P{Oj\N,Z) 



^<j<m, (A.9) 



maximizes the i-prob that Oj occurs iij times for all I <j <m. 

The derivation of the assignment Eq. ( |A.9b justifies the intu- 
itive procedure to take as the numerical values of the i-prob s, 
the frequencies of occurrences, if the latter are known through 
actual measurement. 
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